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H ABSTRACT 



We use the two-degree field Galaxy Redshift Survey (2dFGRS) to test the hierarchical 
scaling hypothesis: namely, that the p-point galaxy correlation functions can be written 
in terms of the two point correlation function or variance. This scaling is expected if 
an initially Gaussian distribution of density fluctuations evolves under the action of 
gravitational instability. We measure the volume averaged p-point correlation functions 
using a counts in cells technique applied to a volume limited sample of 44,931 
galaxies. We demonstrate that L* galaxies display hierarchical clustering up to order 
p = 6 in redshift space. The variance measured for L* galaxies is in excellent agreement 
with the predictions from a A-cold dark matter N-body simulation. This applies to all 
cell radii considered, 0.3 < (R/h~ 1 Mpc) < 30. However, the higher order correlation 
functions of galaxies have a significantly smaller amplitude than is predicted for 
the dark matter for R < 10/i _1 Mpc. This disagreement implies that a non-linear bias 
exists between the dark matter and L* galaxies on these scales. We also show that 
the presence of two rare, massive superclusters in the 2dFGRS has an impact on the 
higher-order clustering moments measured on large scales. 

Key words: cosmology: observations, large-scale structure of Universe 



1 INTRODUCTION 

Current theoretical models of structure formation in the 
Universe are based on the paradigm of gravitational instabil- 
ity. This process is believed to be responsible for driving the 
growth of small primordial density perturbations into the 



nonlinear collapsed structures such as galaxies and clusters 
that are evident in the Universe today. 

The premise of gravitational instability has been tested 
indirectly by comparing the clustering predicted by numeri- 
cal simulations of the formation of cosmic structures with 
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the observed distribution of galaxies (e.g. Benson et al. 
2001). A direct test of this fundamental ingredient of struc- 
ture formation models was made using the 2dFGRS by Pea- 
cock et al. (2001). The size of the 2dFGRS allowed the first 
accurate measurement of the two-point galaxy correlation 
function on large scales. Peacock et al. demonstrated that 
the two-point correlation function at large pair separations 
displays a form that is characteristic of the bulk motions of 
galaxies expected in the gravitational instability scenario. 

We present an independent test of the gravitational in- 
stability paradigm. For a Gaussian distribution of density 
fluctuations, the volume averaged correlation functions, f p , 
are identically zero for p > 2; the density field is completely 
described by its variance, £2- The evolution of an initially 
Gaussian density field due to gravitational instability gen- 
erates non-zero £ p (Peebles 1980). A basic test of the gravi- 
tational origin of the higher order moments is to determine 
their relation to the variance of the distribution. This is tra- 
ditionally encapsulated in the hierarchical model: 

£p = S p ^2 ■ (1) 

This model applies to real space clustering; however, in red- 
shift space the scaling still tends to hold even on small 
scales where the "fingers-of-God" effect is prominent (La- 
hav et al. 1993; Hoyle, Szapudi & Baugh 2000). Perturba- 
tion theory predicts that the hierarchical amplitudes for the 
mass distribution are independent of the cosmological den- 
sity parameter, the cosmological constant and cosmic epoch 
(Bernardeau et al. 2002). 

We use the 2dFGRS (Colless et al. 2001, 2003) to mea- 
sure the higher order correlation functions of the galaxy 
distribution, focusing on the clustering of L, galaxies. The 
size of the 2dFGRS is exploited to extract a volume limited 
sample of L, galaxies, which greatly simplifies our analysis 
(Section 2). The results for the volume averaged correlation 
functions, up to sixth order, are presented in Section 3, in 
which we also test how well the hierarchical scaling model 
works. Our conclusions are given in Section 4. 



2 DATA AND ANALYSIS 

The density of galaxies is a strong function of radial distance 
in a magnitude limited survey. This needs to be compen- 
sated for in any clustering analysis by applying a suitable 
weighting scheme (e.g. Saunders et al. 1991). Alternatively, 
one may construct a volume limited sample by selecting cer- 
tain galaxies from the full redshift survey. These galaxies are 
chosen so that they would appear inside the apparent mag- 
nitude range of the survey if displaced to any redshift within 
the interval defining the sample. The only radial variation in 
galaxy number density in a volume limited sample is due to 
large scale structure in the galaxy distribution. This makes 
volume limited samples much more straightforward to anal- 
yse than flux limited samples. However, only a fraction of 
the galaxies from the full redshift survey satisfy the selection 
criteria in redshift and absolute magnitude. This reduction 
in the density of galaxies has curtailed the utility of volume 
limited subsamples constructed from earlier redshift surveys. 

We construct a volume limited sample of L* galaxies 
from the 2dFGRS. The motivation for the choice of a sam- 
ple centred on L» is clear; this results in a volume limited 



sample with the largest possible number of galaxies for mag- 
nitude bins of a given size. As the luminosity used to define 
a sample increases, the selected galaxies can be seen out 
to larger redshifts and thus sample larger volumes. How- 
ever, brighter than L», the space density of galaxies drops 
exponentially (e.g. Norberg et al. 2002). Hence, the opti- 
mum balance between volume surveyed and intrinsic galaxy 
space density is achieved for L* galaxies. In addition, the 
higher order clustering of i* galaxies provides a benchmark 
or reference against which to compare trends in clustering 
strength with galaxy luminosity (see Norberg et al 2001; 
Croton et al. 2004a). We consider the two contiguous areas 
of the 2dFGRS, referred to as the NGP and SGP regions, 
which contain around 190,000 galaxies with redshifts and 
cover an effective area of approximately 1200 square degrees 
in total. After selecting galaxies with absolute magnitudes in 
the range —19 > Mi, T — 51og 10 h > —20 (corrected to z = 
using the global k + e correction quoted by Norberg et al. 
2002), the volume limited sample contains 44,931 galaxies. 
The redshift interval of the sample is z — 0.021 to 0.130, 
corresponding to a volume of 7.97 x 10 6 /i~ 3 Mpc 3 for the 
combined NGP and SGP regions. 

2.1 Counts in cells analysis 

The distribution of counts in cells is estimated by throwing 
down a large number of spherical cells, on the order of 10 7 
for each cell radius considered, within the L* volume limited 
2dFGRS sample. Full details of how we deal with the spec- 
troscopic incompleteness and the angular mask are given in 
Croton et al. (2004a); the corrections turn out to be small 
in any case (see figure 1 of Croton et al.). 

The higher order correlation functions, £ p , are the re- 
duced p th order moments of the distribution of galaxy counts 
in cells. The estimation of the higher order correlation func- 
tions from the cell count probability distribution is ex- 
plained in a number of papers (e.g. Gaztafiaga 1994; Baugh, 
Gaztanaga & Efstathiou 1995; Croton et al. 2004a). The 
variance or width of the count distribution is given by the 
case p — 2. For p > 2, the correlation functions probe further 
out into the tail of the count probability distribution. 

We use mock 2dFGRS catalogues to estimate the errors 
on the measured higher order correlation functions. Full de- 
tails of the mocks can be found in Norberg et al. (2002) and 
Croton et al. (2004a). 



3 RESULTS 

The projected density of galaxies in the L* volume limited 
sample is shown in Fig.Q The galaxy density projected onto 
the right ascension-redshift plane is smoothed using circular 
windows. Two different smoothing radii have been used to 
produce these maps; the left-hand panel shows the density 
after smoothing with a circular cell of radius 15/i _1 Mpc and 
the right hand panel shows the distribution as sampled with 
a cell of radius 3ft -1 Mpc. The redder colours indicate higher 
galaxy densities, as shown by the scale that accompanies 
each cone plot. Two 'hot-spots' are readily apparent, partic- 
ularly in the cone plot smoothed on the larger scale. These 
correspond to superclusters of galaxies that also appear in 
the 2dFGRS Percolation Inferred Galaxy Group catalogue 
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Figure 1. The projected galaxy density in the L» volume limited sample, smoothed on two different scales. The galaxy distribution is 
projected onto the right ascension-redshift plane and is then smoothed in circular cells of radius 15/i — x Mpc (left) and 3h~ x Mpc radius 
(right). Redder colours denote higher densities as indicated by the key that accompanies each panel. Two "hot-spots" stand out clearly, 
particularly in the left hand coneplots; one in the NGP at z ss 0.08 and the other in the SGP at 2 fs 0.11. Right ascension is given in 
radians. 



(Eke et al. 2004) and in the reconstructed density field of 
the 2dFGRS (Erdogdu et al. 2004). The presence of these 
superclusters has an impact on the extreme event tail of the 
count probability distribution. Later in this section, we will 
investigate the influence of these structures on our measure- 
ment of the higher order correlation functions by excising 
the volumes that contain the superclusters from our analy- 
sis. The 'cosmic web' of filamentary structures and voids is 
apparent in the cone smoothed on the smaller scale. 

The higher order correlation functions measured for L* 
galaxies are plotted in Fig. |2] The correlation functions are 
only plotted on scales for which a robust measurement is 
possible. The correlation functions show a dramatic steep- 
ening on small scales as the order p increases. For example, 
the ratio £ 6 /£ 2 is 10 5 at R = l/i _1 Mpc, falling to ~ 100 
at R — 6.3/i -1 Mpc. We also plot the higher order correla- 
tion functions for the dark matter distribution in the ACDM 
Hubble Volume simulation (Evrard et al. 2002). These the- 
oretical predictions include the effects of peculiar motions 
in the distant observer approximation. The variance of the 
dark matter in redshift space agrees spectacularly well with 
the measured £2 for L* galaxies. This confirms the conclu- 
sions reached in independent analyses of the clustering of 
L* galaxies in the 2dFGRS (Lahav et al. 2002; Verde et al. 
2002). However, for the case of the ACDM Hubble Volume 
simulation, the p > 2 moments of the dark matter differ 
from the measurements for L, galaxies for R < 10fo -1 Mpc. 

The hierarchical amplitudes, S p , obtained from the £ p 
by applying Eq. are plotted as a function of cell radius 



for orders p = 3-5 in Fig. |3j (p = 6 is omitted for clar- 
ity). For p — 3, S3 is approximately constant for cells with 
R < 3/i _1 Mpc. At larger R, S3 increases with radius. This 
behaviour is mirrored for p > 3, with the upturn in S p 
seen at progressively smaller radii as p increases. Pertur- 
bation theory predicts that, on large scales, the S p should 
have only a weak dependence on scale for CDM-like power 
spectra (Juszkiewicz, Bouchet & Colombi 1993). In redshift 
space, the hierarchical amplitudes are expected to be ap- 
proximately independent of scale over an even wider range 
of scales than those on which perturbation theory is applica- 
ble (Hoyle et al. 2000; Bernardeau et al. 2004). We therefore 
attempt to fit a constant value of S p to the ratios plotted in 
Fig. [3] We use a principal component analysis to take into 
account the correlation between the £ p in neighbouring bins 
(e.g. Porciani & Givalisco 2002; for further details of our 
implementation see Croton et al. 2004a). The results of this 
analysis are given in Table 1. In Fig. El the horizontal lines 
show the best fit constant value for S p , fitted over the scales 
0.71 < (R/h~ 1 Mpc) < 7.1. The same range of scales is used 
to fit the S p for each order p. (The choice of scales is set by 
the cell radii for which a reliable measurement of ^6 is possi- 
ble.) The dotted lines indicate the la uncertainty on the fit. 
The errorbars plotted in Fig. 3 show only the diagonal com- 
ponent of the full covariance matrix. The amplitudes S p are 
extremely correlated, with the principal component analysis 
showing that the first few eigenvectors contain the bulk of 
the variance, indicating that there are typically just 2 or 3 
independent points. Sample variance leads to measurements 
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Table 1. The best fit values for S p and the 2-a error (Ax 2 = 4), 
obtained using the measurements for cell radii in the range 0.71 < 
(i?/h~ 1 Mpc) < 7.1. The 2— a errors are approximately twice the 
size of the 1— a errors. The final column gives the reduced \ 2 
using the number of degrees of freedom derived from the principal 
component analysis. 
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which could be coherently shifted either low or high with 
respect to a fixed value. This therefore drives the best fit 
value of S p to lie either below or above a sizeable fraction of 
the data points. For the L* sample, we note that neither S3 
nor S4 are particularly well described by a constant fit (see 
the reduced \ 2 values in Table 1). 

For purely illustrative purposes, we have carried out 
the experiment of removing the two superclusters from the 
L, volume limited sample and repeating our measurement 
of the higher order correlation functions. The corresponding 
results for the hierarchical amplitudes are plotted using open 
symbols in Fig. [3] The upturn in the S p values at large radii 
is no longer apparent. Rather than being considered as a cor- 
rection, the results of this exercise simply serve to show the 
influence of the supercluster regions on our measurements of 
the £ p . Where the difference matters, it effectively indicates 
that the volume of even the 2dFGRS is too small to yield a 
robust higher-order clustering measurement. A further dis- 
cussion of this test is given by Croton et al. (2004a). 

Armed with the best fit values of S p , the hierarchical 
model stated in Eq.^can be used to make predictions for the 
form of the higher order correlation functions and compare 
these with the measurements from the 2dFGRS L, galaxy 
sample (symbols in Fig. [I] reproduced from Fig. The 
solid lines in Fig. 0] show the £ p predicted from the hier- 
archical scaling relation (Eq.0, assuming a constant value 
for the hierarchical amplitudes, S p , and using the measured 
variance, {2- The dotted lines show the uncertainty in the 
theoretical predictions, derived from the 1— a error in the 
fitted values of the S p and the error on the measured vari- 
ance, £2- The theoretical predictions for the different orders 
agree spectacularly well with the measured higher order cor- 
relation functions over the range of scales for which the S p 
are fitted. 



4 CONCLUSIONS 

We have measured the higher order correlation functions of 
L* galaxies up to sixth order in the 2dFGRS. Previous stud- 
ies of galaxy clustering in redshift space have been limited to 
fourth order (e.g. for optically selected samples: Gaztanaga 
1992; Benoist et al. 1999; Hoyle et al. 2000: for infra-red se- 
lected samples: Bouchet et al. 1993; Szapudi et al. 2000). The 
volume limited sample of L, galaxies analysed here contains 
100 times more galaxies and covers 10 times the volume of 
the previous best measurements from an optically selected 
galaxy redshift survey (Hoyle et al. 2000). The measured 
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Figure 2. The higher order correlation functions £p measured 
for Lt galaxies in the 2dFGRS (symbols). The orders p = 2- 
6 are shown, as indicated by the key. The errorbars show the 
rms scatter estimated using mock 2dFGRS catalogues. The lines 
show the §p measured for the dark matter in redshift space in 
the ACDM Hubble Volume simulation, for orders p = 2 to 6 in 
sequence of increasing amplitude for R < lO/i^Mpc. 
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Figure 3. The hierarchical amplitudes, S p , for p = 3,4 and 
5, plotted as a function of cell radius for the L* volume limited 
sample. The filled symbols connected by solid lines show the re- 
sults obtained using the full volume. The best fit constant values 
of S p are shown by the horizontal solid lines, which are plotted 
over the range of scales used in the fit. The dotted lines show the 
1— a error on the fit. The open symbols connected by dashed lines 
show the hierarchical amplitudes recovered when the two largest 
superclusters are masked out of the volume. 
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Figure 4. The higher order correlation functions, § p , measured 
for L* galaxies in the 2dFGRS (symbols, as in Fig. 2) compared 
with the predictions of the hierarchical model (Eq. solid lines). 
The hierarchical predictions are plotted only on the scales used to 
fit S p . The dotted lines indicate the errors on these predictions, 
with contributions from the error on the fitted value of S p and 
on the measured variance £2- 



correlation functions have a form that is in remarkably good 
agreement with the predictions of hierarchical scaling, and 
extend to smaller scales than those for which the perturba- 
tion theory predictions are expected to be valid (Bernardeau 
et al. 2002). A similar conclusion was reached by Croton et 
al. (2004b), who found hierarchical scaling in the reduced 
void probability function measured in the 2dFGRS. 

On scales larger than about 4/i _1 Mpc, there is an up- 
turn in the values of S p , which we have demonstrated is 
influenced by the presence of two large superclusters in the 
2dFGRS (see Fig. 3). This suggests that finite sampling af- 
fects our measurements on these scales. A similar feature was 
found in the angular Edinburgh-Durham Southern Galaxy 
Catalogue (EDSGC). Szapudi & Gaztanaga (1998) found 
that the projected S p measured from the EDSGC displayed 
an up-turn for scales larger than 0.5 degrees, which corre- 
sponds to ~ 4/i _1 Mpc at the characteristic depth of the 
survey. The EDSGC covers a similar part of the sky to the 
2dFGRS. This feature in S p was not found, however, in the 
APM Survey, which covers a four times larger solid angle 
than the EDSGC (Gaztanaga 1994). This behaviour is not 
seen in the mock catalogues drawn from the ACDM Hubble 
Volume simulation. Intriguingly, an upturn in the hierar- 
chical amplitudes on large scales is expected in structure 
formation models with non-Gaussian initial density fields 
(Gaztanaga & Fosalba 1998; Bernardeau et al. 2002). 

Finally, we note that the variance of the distribution 
of cell counts for L* galaxies is in excellent agreement with 
the predictions for CDM, obtained from the Hubble Volume 
ACDM simulation, which includes the effects of peculiar mo- 
tions on the clustering pattern. However, for cells with radii 



R < 10/i _1 Mpc the higher order correlation functions of L* 
galaxies have significantly lower amplitudes than the dark 
matter. This implies that the relation between the distribu- 
tion of galaxies and the underlying dark matter may be more 
complicated than the popular linear bias model, suggesting 
that nonlinear contributions to the bias may be important 
on small and intermediate scales (Fry & Gaztanaga 1993; 
see also the analyses by Conway et al. 2004 and Wild et al. 
2004). We note that on large scales (R > 10ft _1 Mpc), the £ p 
measured for L* galaxies agree better with the ACDM pre- 
dictions, supporting the conclusion reached previously, that 
on these scales, L* galaxies approximately trace the mass 
distribution (Gaztanaga & Frieman 1994; Lahav et al. 2002; 
Verde et al. 2002). 

We explore the distribution of galaxy counts in cells 
for the 2dFGRS in more detail in Croton et al. (2004a), 
where we study the dependence of the correlation functions 
on luminosity. 
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